Talking Face Generation


Talking face generation is the process of generating videos of a person speaking based on an audio recording of their voice.

LPIPS-AttnWav2Lip: Generic Audio-Driven lip synchronization for Talking Head Generation in the Wild

Add code
Jan 30, 2026
Viaarxiv icon

MIRRORTALK: Forging Personalized Avatars Via Disentangled Style and Hierarchical Motion Control

Add code
Jan 30, 2026
Viaarxiv icon

Generalizable and Animatable 3D Full-Head Gaussian Avatar from a Single Image

Add code
Jan 19, 2026
Viaarxiv icon

RSATalker: Realistic Socially-Aware Talking Head Generation for Multi-Turn Conversation

Add code
Jan 15, 2026
Viaarxiv icon

Now You See Me, Now You Don't: A Unified Framework for Expression Consistent Anonymization in Talking Head Videos

Add code
Jan 14, 2026
Viaarxiv icon

Efficient and Robust Video Defense Framework against 3D-field Personalized Talking Face

Add code
Dec 24, 2025
Viaarxiv icon

FacEDiT: Unified Talking Face Editing and Generation via Facial Motion Infilling

Add code
Dec 16, 2025
Figure 1 for FacEDiT: Unified Talking Face Editing and Generation via Facial Motion Infilling
Figure 2 for FacEDiT: Unified Talking Face Editing and Generation via Facial Motion Infilling
Figure 3 for FacEDiT: Unified Talking Face Editing and Generation via Facial Motion Infilling
Figure 4 for FacEDiT: Unified Talking Face Editing and Generation via Facial Motion Infilling
Viaarxiv icon

ActAvatar: Temporally-Aware Precise Action Control for Talking Avatars

Add code
Dec 22, 2025
Viaarxiv icon

TAVID: Text-Driven Audio-Visual Interactive Dialogue Generation

Add code
Dec 23, 2025
Viaarxiv icon

VASA-3D: Lifelike Audio-Driven Gaussian Head Avatars from a Single Image

Add code
Dec 16, 2025
Viaarxiv icon